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Abstract 

The Sudoku puzzle has achieved worldwide popularity 
recently, and attracted great attention of the computational 
intelligence community. Sudoku is always considered as 
Satisfiability Problem or Constraint Satisfaction Problem. 
In this paper, we propose to focus on the essential graph 
structure underlying the Sudoku puzzle. First, we formalize 
Sudoku as a graph. Then a solving algorithm based on 
heuristic reasoning on the graph is proposed. The related 
r-Reduction theorem, inference theorem and their properties 
are proved, providing the formal basis for developments of 
Sudoku solving systems. In order to evaluate the difficulty 
levels of puzzles, a quantitative measurement of the complex- 
ity level of Sudoku puzzles based on the graph structure and 
information theory is proposed. Experimental results show 
that all the puzzles can be solved fast using the proposed 
heuristic reasoning, and that the proposed game complexity 
metrics can discriminate difficulty levels of puzzles perfectly. 

1. Introduction 

The Sudoku puzzle has become popular in Japan since 
1986, and became world-wide popular in 2005. The com- 
pletion rules of the puzzle are rather simple, however the 
required reasoning techniques for solving the puzzle are 
often complex. Because Sudoku puzzle is a good example 
of various reasoning approaches in Artificial Intelligence 
area, it has attracted great attention of the computational 
intelligence community lITl Bl iTTTl ifTUl |[T3l lfl4l . 

Fig. [Tj is an example of the Sudoku puzzle. The left part 
provides a Sudoku puzzle, and its solution is shown on the 
right. We define the notations as follows: 

Definition 1: The Sudoku puzzle is a 9 x 9 grid, which 
comprises nine 3x3 sub-grids, which comprises 3x3 cells. 
Some cells are filled with numbers from 1 to 9, whereas 
others are left blank. 

Definition 2: The Sudoku puzzle is solved by filling all 
the blank cells with numbers from 1 to 9, such that every 
row, every column and every sub-grid contains each of 
the nine possible numbers. Sudoku puzzle has exactly one 
solution. 

There are mainly three directions in the Sudoku puzzle 
studies: 



1. Sudoku generation. One question is how to generate 
randomly a Sudoku puzzle which has exactly one solution. 
Another question is the estimation of the total number of 
the possible Sudoku puzzles 1 2 1 1 3l ifTTI . 

2. Sudoku solving. Up to now, most of the studies are on 
this direction. People proposed several practical reasoning 
algorithms based on different knowledge representations. 
ifTSl shows that the generalized Sudoku puzzle is NP- 
Complete. 00IHH encode Sudoku as a Satisfiability Prob- 
lem (SAT), and use a general SAT solver to get the solution. 
ifTOllTOl represents the Sudoku puzzle as a Constraint Satis- 
faction Problem (CSP), and compares different propagation 
schemes for solving Sudoku. However, (5) reported that 
some hard puzzles cannot be solved using these methods, 
such as ||6)||9). In this paper, we will formalize Sudoku as 
a graph, and develop a solving algorithm based on heuristic 
reasoning on the graph. 

3. Sudoku complexity evaluation. Most of the Sudoku 
designers classify the difficulty levels of Sudoku puzzles as, 
for instance, "easy", "intermediate" or "hard" (or levels 1, 
2, 3) according to their experiences. However, up to now 
there are few publications on this subject. [7 1 only mentioned 
that the difficulty level depends on the number of initial 
non-blank cells provided. This is a very rough evaluation, 
and we will propose a more accurate quantitative metric 
for difficulty evaluation based on the complexity of the 
underlying graph structure. 

This paper deals with the second and third aspects, in 
order to provide a formal basis for the studies and develop- 
ments of Sudoku solving systems and difficulty evaluation. 
In section II, the graph structure underlying Sudoku puzzle 
is defined. In section III, heuristic reasoning on the graph is 
proposed, and its related theorems, properties and algorithm 
are explained in detail. Section IV focuses on how to 
evaluate the game complexity of the Sudoku puzzle based 
on information theory. To assess our theory, experiments 
are performed and the results are discussed in section V. 
Conclusion and future works are presented at the end. 

2. The Sudoku Puzzle and Graph 

The Sudoku puzzle is essentially a directed graph. The 
cells are vertices, and the constraints are directed edges. Fig. 
[2] shows part of the Sudoku graph related to cellos, of the 
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Figure 1 . 
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Figure 2. Part of the Sudoku Graph 

puzzle in Fig. [T] The cells in the same row, the same column 
and the same sub-grid as cell/g, are vertices in the subgraph, 
and the edges represent the constrain relationships between 
and the connected cells. Notice that the full graph is 
constructed by 9 x 9 such subgraphs that are connected to 
be the entire graph. Initially, cell^ is filled with number 6, 
so we say that the set of possible values of the vertex is 
v 45 = {6}, and the filled value is C45 = 6. 

In Fig. [T] the vertex V13 may be filled with 1, 2, or 4, 
so the candidates set is u 13 = {1,2,4}, and the size of 
the set is \vxa\ = 3. In the reasoning process, if we find 
that cell^i should be filled with 1, then we can remove 
1 from the candidates set 1/13 (they are connected by an 
edge £31.13), so 1/13 = {2,4}. That is, the information of 
possible candidates in each cell is propagated through the 
edges on the graph, and change the values of candidates 
sets of connected vertices. The graph underlying the Sudoku 



puzzle is formally defined as follows. 

Definition 3: The Sudoku alphabet is = 

{1,2,3,4,5,6,7,8,9}. 

Definition 4: The vertex candidates set is = {x\x £ 
J2, and celkj is possible to be filled with the number x with 
the information we have}. 

Definition 5: If \vij\ = 1, the vertex celkj is filled with 
the number £ My. 

Definition 6: The Sudoku graph is G = (V,E), where 
the vertices set V = {celkj\celkj is the cell in i-th 
row and j-th column, 1 < i,j < 9}, the set of edges 
E = {(cellij,cellki),(cellki,cellij)\ there is the constraint 
Cij ^ cm between celkj and cellu}- For simplification, edge 
(cellij , cellki) is noted as eij^i- 

Definition 7: Initiation of Sudoku Graph: V cellij, if 
cellij is filled with number k in the puzzle as initial, then 

Cij = k, Vij = {&}■ 

Both the solving algorithm and game complexity proposed 
in this paper are based on the graph structure. Ideally, player 
can solve the puzzle in this way: player starts from any 
one of the vertices, traverses all the vertices once and only 
once, along with the directed edges. Each time when the 
player arrives at a vertex celkj, he or she want to choose 
the correct number x € to fill the cell c^ = x, and output 
the number. The player continues to traverse the remaining 
vertices by choosing one of the edges, until all the vertices 
are visited. At the end, the output of numbers is the solution. 

However, how to "choose the correct number x £ Vij" 
is the essential uncertain problem. In the next section, a 
heuristic reasoning approach is proposed to achieve this goal. 
And the difficulty level of the Sudoku puzzle is evaluated 
based on the concept of "uncertainty of choice" at each 
vertex of the graph, since it is related to the magnitude of 
uncertainty in the reasoning. 

3. Heuristic Reasoning on the Graph 

In order to present the heuristic reasoning on the Sudoku 
graph, we define the concept of "section" of a Sudoku graph. 



Definition 8: Section S is a complete subgraph in the 
Sudoku graph G. 

"Complete subgraph" is a basic concept from discrete 
mathematics. The definition means that a section is a 
subgraph of graph G, such that every two vertices in a 
section are directly connected by two directed edges of 
opposite directions. The vertices in a row, a column or a 
sub-grid construct totally 27 "sections". Moreover, they are 
the only sections in a Sudoku graph. Each vertex belongs to 
3 sections. 

For heuristic reasoning, the most obvious fact is that if a 
cell is filled with a number, then the cells in the same section 
cannot be filled with the same number. So this number can be 
removed from the candidates sets. Because we only remove 
one element from the sets, the process is called 1 -Reduction. 

Theorem 9 {1 -Reduction): k, I, if 3|i>y| = 1, and 
there is an edge e^n, then c k i £ v M 

proof: According to Definition [5] 



cm G v M 

There is an edge ey^j, according to Definition [6] 
cm- So, 

Cfcz G v kl - {cij} 
\ = 1, so Vij = {cij}, we conclude c k i G vm — 



(1) 

(2) 

Using this theorem, once a cell is filled with number cy, 
then we can remove the number from the candidates sets of 
the connected vertices. 

However, sometimes we are not able to find such a vertex 
having |uy| = 1, but we are able to find r vertices that 
share r possible numbers. In this case, we can remove the 
r numbers from the candidates sets of the other vertices 
that are in the same section as the r vertices. Notice that 
r numbers are reduced from the sets. To formalize the 
information propagation on the graph, we have a more 
general reasoning theorem: 

Theorem 10 (r-Reduction): In any section S/., if Bcelli, 
cell2, cell r £ Sk such that \v\ UU2U ... U v r \ — r, then 
Wcella £ Sk and cellij (£ {celh, cell 2 , cell r }, we have 



% 3 ^- LJk emu ^^i/vij 
Cij £ Vij — Vi U V 2 U ... U V r . 

proof: Vertices cellx, cell 2 , ■ 



celhj are in the same 



section, so cx,c 2 , ...,Cr and cy are all different, 

\{c 1 ,c 2 ,...,c r }\ = r 

Cij £ - {ci,C 2 , ...,Cr} 

According to Definition [5] 

{ci, c 2 , Cj.} C vi U v 2 U ... U v r 

So, 



(3) 
(4) 

(5) 

\{ci,c 2 , ■■■,c r }\ < \vi Uv 2 U ... U v r \ (6) 
with equation ([3]) and assumption in the theorem, we have, 
r = |{ci,c 2 , ...,c r }| < \vi Uv 2 U ... U v r \ = r (7) 



it implies that in equation (|5]), 

{ci,c 2 , ...,c r } = vi U v 2 U ... U w r 
substitute for equation Q, we conclude: 

c^j £ — vi U v 2 U ... U v r 



(8) 



(9) 

□ 

This theorem allows player to remove a set of numbers 
from the candidates set of a vertex, depending on the 
candidates sets of connected vertices in the same section. 



Theorem 10 is a very strong reduction rule for the Sudoku 
puzzle reasoning. In experimental results, we will show 
that it can solve nearly all the Sudoku puzzles, except 
some very hard puzzles. Because these very hard puzzles 
provide less redundant pieces of information for heuristic 
reasoning [5|. In order to solve all the puzzles, we have 
to add the following Inference Rule. This theorem allows 
player to make additional assumptions, which can increase 
the redundant information for reasoning. 



Theorem 11 (Inference): Vv, 



{ci, c 2 , ...}, if we set 



Vij = {ck} C {ci,c 2 , ...}, then if r-Reduction can reduce 
the puzzle to a solution, then we conclude cy = c k . 
proof: As we know, cy £ v^ is a single number. The puzzle 
has exactly one solution, so cy is in the solution. Because 
we have set «y = {ck}, cellij is surely filled with Cfc in the 
solution. So we conclude cy = c^. □ 



Based on Theorem 10 and Theorem 11 we propose the 
following algorithm that can solve all the Sudoku puzzles 
in nearly polynomial time. 

Algorithm 12 (Algorithm for solving Sudoku): 
Initiation: set global variables vu, W99, according to the 
filled cells in Sudoku. 

bool Solver() 
{ 

while(l) 
{ 

Reduction(); 

if (Sudoku is not solved) 

Inference(); 
else if (Sudoku has no solution) 

return false; 
else return true; 



} 



Reduction() 
{ 

r=l; 

while(r < 9){ 
r-Reduction; 

if(no candidates set is reduced) 
r++; 



} 



else r=l; 
} 



H = -§T E« log = - log i - i E« log 



Inference() 
{ 

Select randomly a vertex cei/^ which was not selected 

and has \vij\ > 1; 
for each x £ Vij 

{ 



set c. 



'7 



x: 



} 



if( Solver() ) return; 



} 

As we mentioned in Theorem [TUJ the "r-Reduction" 
statement in function "Reduction()" tries to find r vertices 
that share r numbers for reduction, for each section in the 
27 sections. Its computational complexity is polynomial. 

Notice that if we only use the inference rule, the reasoning 
process actually generates a backtracking inference tree, 
with exponential complexity. However, if r-Reduction can 
solve puzzles with few inferences (e.g., 1 or 2 times), the 
practical computational complexity is still nearly polyno- 
mial. As we can learn from the algorithm, we only use the 
inference as an assistant of r-Reduction. As we will show in 
the experiments, the practical computational complexity is 
almost equal to the polynomial complexity of r-Reduction. 

4. On the Game Complexity of Sudoku 

The game complexity of Sudoku depends on the initial 
distribution of the numbers in the cells Q. This section 
aims at providing a quantitative metric to assess the com- 
plexity without any playing. In lfl2l . information entropy 
is proposed to evaluate the uncertainty of signals sent by 
an information source. The "uncertainty" concept is similar 
to the uncertainty of choosing the correct number for the 
vertex when the player is traversing the Sudoku graph, by 
considering the possible choices on a vertex as the possible 
signals to be sent in a state of information source. So 
we propose to evaluate the complexity of solving Sudoku 
puzzles using the information entropy. 

On the player's viewpoint, there are \vij \ possible choices 
for each vertex celkj. At the first step, the player has to 
choose one from the \vij\ possible choices. Then, at the 
second step, the player chooses to go to one of the 20 
connected vertices in the same row, column and sub-grid 
(see Fig. |2j. Taking the two steps together, the player has 
20 x \vij \ choices ((cfc, eij,i m ) pairs). The choices are of the 
same probability, because the player can only try them one 
by one without additional hints. So for each vertex v^, the 
entropy is H tJ = - log 

For the entire graph, the entropy is the average entropy 
of all the 81 vertices: 



The first part "— log ^j" of the formula represents the 
intrinsic complexity of the puzzle which is associated with 
the structure (20 connected vertices). Then the later part of 
the formula expresses the complexity related to the distri- 
bution of the initial values. For simplification, we remove 
the constant part — log ^ from the equation, and define the 
Sudoku entropy and the game complexity as follows. 
Definition 13: The Sudoku entropy is defined as 



(10) 



Definition 14: The game complexity is the Sudoku En- 
tropy after initiation plus one 1 -reduction. 

The difficulty for players to solve the puzzle is based 
on the Sudoku entropy. The game complexity for players 
is the Sudoku entropy after the initiation of the graph, and 
removing the numbers from vertex candidates set which are 
obviously impossible choices. 

For example, in Fig.[T] after initiation and one 1-reduction, 
ceZ?3i has the candidates set {1,2} , so H31 = — log i. 
Another vertex celli3 has the candidates set {1,2,4}, so 

Sl3 = -l0gi 

5. Experimental Results 

In this section, we show the effectiveness of our 
algorithm for solving Sudoku and of the quantitative 
metrics for Sudoku complexity estimation. 

Up to now, since difficulty levels are defined by experts, 
there is no tool based on an analytical approach for Sudoku 
estimation. So we have created test sets of three difficulty 
levels. The "easy" level and the "intermediate" level include 
both 24 instances from the first 3 books of easy and 
intermediate Sudoku puzzles respectively in [4|. 

According to Q, there are no known puzzles with 16 pre- 
assigned cells, because they lead to more than one solution. 
So the puzzles with only 17 pre-assigned cells are the 
most difficult puzzles. When additional cells are assigned, 
the information redundancy increases, and the complexity 
decreases. [8| contains 47,621 Sudokus of exact 17 pre- 
assigned cells. Our "hard" level data set includes 10,000 
puzzles from them. 

Using the reasoning algorithm and the game complexity 
metric based on entropy, the results of the 3 levels are 
compared in Table [T] The r-Reduction rule is strong enough 
to solve all the "easy" and "intermediate" puzzles, and 
70.5% of the "hard" puzzles. 

The values of game complexity discriminate the actual 
difficulty levels of puzzles perfectly. The easy puzzles have 
game complexity values in the domain [0.5362, 0.9046], 
which are all less than the values of intermediate puzzles 



in the domain [1.2246, 1.3965], which are all less than 
the values of hard puzzles in the domain [1.6946, 1.8189]. 
Therefore, our quantitative estimation is highly in accordant 
with the evaluation of the experts who design the puzzles. 





Easy 


Intermediate 


Hard 


Solved (r-Reduction only) 


100% 


100% 


70.5% 


Solved (r-Reduction + Inference ) 


(100%) 


(100%) 


100% 


Minimum Game Complexity 


0.5362 


1.2246 


1.6946 


Maximum Game Complexity 


0.9046 


1.3965 


1.8189 


Average Game Complexity 


0.7439 


1.3049 


1.7526 



Table 1 . Results on 3 levels 



The detailed results of "hard" puzzles are reported in 
Table [2] If we only use r-Reduction, 70.5 percent of the 
puzzles can be solved. If the inference rule is allowed, all 
the puzzles can be solved with only average 1.6 inferences 
for a puzzle. So, as explained in section 3, this keeps 
the practical computational complexity of the algorithm in 
nearly polynomial time. There is nearly no difference on 
time consumed between the situations when the inference 
rule is enabled or not. 

The average game complexity of the unsolved puzzles 
is greater than that of the solved puzzles, which proves 
again that our metric can discriminate the difficulty levels 
of Sudoku puzzles. 

People are trying to produce more diabolic puzzles to 
challenge the reasoning ability of computers. For example, 
|5| reported that many solvers and algorithms fail to solve 
the problems from (6) 0, such as "AI Escargot" puzzle. 
Using our algorithm, the computer can solve "AI Escargot" 
with only 91 inferences, which is claimed to be "the most 
difficult Sudoku puzzle known so far". Moreover, the 21 
puzzles from [9] are solved with 2442 inferences, average 
116 inferences for one puzzle. These results prove that our 
algorithm can solve all the known puzzles at any difficulty 
level with high efficiency. 

6. Conclusion and Future Works 

In this paper, we provided a formal basis for studies and 
developments of Sudoku solving systems and complexity 
evaluation. We defined the formal graph-based knowledge 





Solved 


Unsolved 


IT* 


AI* 


r-Reduction 
only 


7050(70.5%) 
GC* 1.75226 


2950(29.5%) 
GC* 1.75349 








r-Reduction 
+ Inference 


10000(100%) 





16034 


1.6034 



IT: Inference limes. AI: Average Inferences. G(J: Game Complexity. 



Table 2. Results of hard puzzles 

representation of the Sudoku puzzle, and proposed a heuris- 
tic reasoning algorithm based on the theorems of r-Reduction 



and Inference. The r-Reduction theorem is a very strong 
heuristic reasoning rule for Sudoku puzzles, and we have 
showed that it can solve most of the puzzles itself. In order 
to solve the puzzles more complex, we proposed to add 
the inference theorem as assistant in the algorithm. Because 
the r-Reduction rule is strong enough, few inferences are 
needed. Even for the hard puzzles, we only need to infer 1.6 
times for each puzzle. This keeps the practical computational 
complexity of the algorithm in almost polynomial time. 

It is believed that all the Sudoku puzzles can be solved 
with only heuristic reasoning, without backtracking infer- 
ences. However, if only using r-Reduction, 70.5% of the 
hard puzzles are solved. So, for the future works, we are 
trying to find a stronger rule by extending r-Reduction, in 
order to solve all the puzzles using a single rule. 

In fact, we developed an approach based on heuristic 
reasoning on certain discrete structure (graph), to solve a 
constraint satisfiability problem. This approach may provide 
a new method and concrete example for solving constraint- 
based problems. And it could also lead to new techniques 
for developing tutoring systems for Sudoku learners (T). 

We also proposed game complexity to quantitatively mea- 
sure the difficulty levels of Sudoku puzzles. The concept can 
be interpreted very well in the players' viewpoints, and it 
is also shown to be effective in experiments. It could also 
provide a potential measurement of uncertainty in reasoning. 
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